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Genome- Wide Mutant Fitness Profiling Identifies Nutritional 
Requirements for Optimal Growth of Yersinia pestis in Deep Tissue 
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ABSTRACT Rapid growth in deep tissue is essential to the high virulence of Yersinia pestis, causative agent of plague. To better 
understand the mechanisms underlying this unusual ability, we used transposon mutagenesis and high-throughput sequencing 
(Tn-seq) to systematically probe the Y. pestis genome for elements contributing to fitness during infection. More than a million 
independent insertion mutants representing nearly 200,000 unique genotypes were generated in fully virulent Y. pestis. Each 
mutant in the library was assayed for its ability to proliferate in vitro on rich medium and in mice following intravenous injec- 
tion. Virtually all genes previously established to contribute to virulence following intravenous infection showed significant fit- 
ness defects, with the exception of genes for yersiniabactin biosynthesis, which were masked by strong intercellular complemen- 
tation effects. We also identified more than 30 genes with roles in nutrient acquisition and metabolism as experiencing strong 
selection during infection. Many of these genes had not previously been implicated in Y. pestis virulence. We further examined 
the fitness defects of strains carrying mutations in two such genes — encoding a branched-chain amino acid importer (brnQ) and 
a glucose importer (ptsG) — both in vivo and in a novel defined synthetic growth medium with nutrient concentrations matching 
those in serum. Our findings suggest that diverse nutrient limitations in deep tissue play a more important role in controlling 
bacterial infection than has heretofore been appreciated. Because much is known about Y. pestis pathogenesis, this study also 
serves as a test case that assesses the ability of Tn-seq to detect virulence genes. 

IMPORTANCE Our understanding of the functions required by bacteria to grow in deep tissues is limited, in part because most 
growth studies of pathogenic bacteria are conducted on laboratory media that do not reflect conditions prevailing in infected 
animal tissues. Improving our knowledge of this aspect of bacterial biology is important as a potential pathway to the develop- 
ment of novel therapeutics. Yersinia pestis, the plague bacterium, is highly virulent due to its rapid dissemination and growth in 
deep tissues, making it a good model for discovering bacterial adaptations that promote rapid growth during infection. Using 
Tn-seq, a genome-wide fitness profiling technique, we identified several functions required for fitness of Y. pestis in vivo that 
were not previously known to be important. Most of these functions are needed to acquire or synthesize nutrients. Interference 
with these critical nutrient acquisition pathways may be an effective strategy for designing novel antibiotics and vaccines. 
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Serendipitous quirks of Yersinia biology catalyzed much of the 
research into the molecular basis of Y. pestis pathogenesis in 
the last three decades. For example, the finding that the addition of 
skim milk to the growth medium enhanced retention of virulence 
of Y. pestis cultured at 37°C (1) ultimately led to the discovery of 
type III secretion (2). The identification of the yersiniabactin sid- 
erophore system required for bacterial iron acquisition during 
infection (3, 4) was similarly prefaced by the observation that the 
ability of Y. pestis to form pigmented colonies on medium con- 
taining hemin was somehow linked to virulence (5, 6). A large 
fraction of current research on Yersinia virulence centers exclu- 
sively on the few virulence determinants discovered by these early 
efforts. To date, these studies have largely eclipsed more system- 
atic attempts to define virulence determinants and other impor- 
tant factors contributing to severity of disease. 



The limitations of previous iterations of whole-genome tech- 
niques are partly responsible for this focus. Prior genome-level 
studies of Y. pestis relied on acquisition of gene expression data 
under various in vitro and in vivo conditions ( 7- 1 1 ) . These results 
provide a deeper understanding of physiological adaptations em- 
ployed by the bacterium in different environments; however, gene 
expression data have not proved to be a reliable predictor of viru- 
lence determinants (12-14). Attempts to circumvent this problem 
by directly measuring the functional contribution of Y. pestis 
genes to bacterial growth in vivo have been more successful but 
rely on measuring the fitness of a defined panel of mutants rather 
than implementing a truly unbiased genome-scale technique (15). 

Recently, the combination of transposon mutagenesis and 
high-throughput sequencing has provided a more direct approach 
to functional genomics. This method has been assigned several 
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FIG 1 Dense transposon mapping of the Y. pestis genome allows identification of regions under selection. The number of insertions sequenced at each site is 
represented by bar height (log scale). Circular plots were generated using the software program CGView (62). (A) Transposon insertions mapping to the 
chromosome. Large gaps correspond to blocks of essential genes, such as the NADH dehydrogenase subunits (inset; insertions are shown by the Integrated 
Genomics Viewer software tool [63, 64] on a log scale). (B) Transposon insertions mapping to the type III secretion plasmid pCDl. Insertions in many genes in 
the first 30 kb of pCDl, which encode regulatory and structural components of the type III secretion system, are selected against in vivo (outer ring; red bars) but 
not in vitro (inner ring; blue bars). The gap of insertions at the 60-kb mark corresponds to plasmid replication genes. 



names, including HITS (16),TraDIS (17), INSeq (18), andTn-seq 
(19). Tn-seq employs deep sequencing to map and count trans- 
poson/genome junctions from a dense library of random trans- 
poson insertion mutants grown en masse under each condition of 
interest. The number of transposon insertions detected in a given 
gene reflects the prevalence of the corresponding mutants in the 
library following selection (20). This information can be used to 
estimate relative fitness for mutants in different growth environ- 
ments. 

Tn-seq is an increasingly popular tool in studies of bacterial 
pathogens ( 14, 16-3 1 ), although to date only five pathogens have 
been examined using animal infection as a selection regime {Strep- 
tococcus pneumoniae [23, 26], Haemophilus influenzae [16], Yer- 
sinia pseudotuberculosis [25], uropathogenic Escherichia coli [30], 
and Mycobacterium tuberculosis [31]). Here, we used Tn-seq to 
measure the fitness of nearly 200,000 unique Y. pestis insertion 
mutants during growth in vitro in rich culture medium and in 
mice following intravenous inoculation. We identified several 
genes contributing to bacterial fitness during infection that had 
not previously been implicated in virulence, including more than 
30 genes with roles in nutrient import and metabolism. The ma- 
jority of known virulence factors required for optimal bacterial 
growth during infection were also required in the Tn-seq data set, 
underscoring the suitability of this method in the context of infec- 
tion studies. 

RESULTS 

Transposon mapping of the Yersinia pestis genome. We mu- 

tagenized the fully virulent Y. pestis strain KIM1001 (see Text SI in 



the supplemental material) with a WmarZ -derived transposon to 
construct a bank of approximately 1.5 million independent inser- 
tion mutants. To select against mutants harboring insertions in 
genes required for growth in vitro, the insertion library was grown 
on TB, a complex medium adapted from tryptose blood agar base 
that provides a broadly permissive growth condition for insertion 
mutants. 

To select against mutants harboring insertions in genes re- 
quired for growth during mammalian infection, we performed 
intravenous infections of three C57BL/6 mice with 2.3 X 10 7 CFU 
of the insertion library, which resulted in a population of ~10 6 
bacteria in each spleen 1 h postinfection. This infection route pre- 
serves library diversity by circumventing bottlenecks imposed by 
peripheral routes of infection. Bacteria were recovered from 
spleens 40 h postinfection by plating on TB agar. Bacterial popu- 
lations in the spleen at the time of harvest averaged 1.63 X 
10 9 CFU/spleen (ranging from 1.4 X 10 9 to 2.1 X 10 9 ). This pop- 
ulation expansion is in excess of 1,000-fold, large enough to per- 
mit detection of modest reductions in fitness even in the absence 
of bacterial killing by host defenses. 

Following selection, we mapped the transposon insertions 
present in the bacterial populations recovered from the in vitro 
and in vivo expansions via selective amplification of transposon- 
genome junctions and Illumina sequencing, using methods simi- 
lar to those previously described (see Text SI in the supplemental 
material). The himarl transposon derivative used for mutagenesis 
inserts at TA dinucleotides (32, 33). Approximately 69% (187,780 
of 271,354) of TAs in the Y. pestis KIM genome harbored inser- 
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FIG 2 Selection of chromosomal Y. pestis genes during growth in vitro on rich medium. (A) The relative fitness of mutants in each chromosomal ORF plotted 
against significance (calculated by resampling; see Text SI in the supplemental material). Genes harboring no sequenced insertions in the central 90% of the ORF 
(relative fitness = 0) are shown in the box at the left of the plot. (B) Relative fitness (histogram) correlates well to essentiality, as predicted by the hidden Markov 
model recently published by Dejesus and Ioerger (36) (red, essential; yellow, growth-deficient; green, nonessential) . Genes harboring no sequenced insertions in 
the central 90% of the ORF (relative fitness = 0) are shown at the left of the plot. (C) Three examples of genes with modest reductions in relative fitness that were 
predicted to be essential by the hidden Markov model. Many genes in this class contain groups of consecutive TA sites that do not harbor insertions (red boxes), 
suggesting domain architecture. For example, the apparently essential N terminus of the rne gene is highly homologous to the essential E. coli gene of the same 
name (65, 66), while the evidently nonessential C terminus is highly divergent from the E. coli homologue, as has been reported for several other species (67); also 
shown is the apparently essential C-terminal DNA-binding domain of the rcsB gene, a transcriptional activator with multiple roles, including small RNA 
production and cell division (68). Insertions are shown by the Integrated Genomics Viewer software tool (63, 64) on a log scale. 



tions, an average density of 39 insertions per kilobase. This high 
density of insertions allowed fine-scale mapping of functional ge- 
netic units, including intergenic regions (Fig. 1A). The density of 
mutations on pMTl, pCDl, and pPCPl, the three plasmids pres- 
ent in Yersinia pestis KIM, was similar to that on the chromosome 
(78% on pMTl, 76% on pPCPl, and 63% on pPCPl, compared to 
69% on the chromosome) (Fig. IB). Although substantial large- 
scale heterogeneity in insertion density has been observed after 
similar transposon mutagenesis of other pathogens (21, 34), in- 
sertions in our library were distributed relatively evenly across the 
chromosome (Fig. 1A). 

Identifying genes required for optimal growth in vitro. To 
identify genes required for optimal growth of Y. pestis on rich 
medium, we analyzed the fall set of Y. pestis KIM open reading 
frames (ORFs) annotated in the NCBI database by two distinct 



mathematical approaches for measuring the effect of insertions on 
fitness. 

In the first approach, we estimated the relative fitness ( W) of 
mutants harboring insertions in each chromosomal ORF, using 
an analysis similar to that described by van Opijnen and Camilli 
(35). In the context of evolutionary biology, relative fitness is de- 
fined as the ratio of the rates of population expansion for the two 
genotypes being compared. Mutants with relative fitness of < 1 are 
defective in growth or survival relative to the wild type, while 
mutants with relative fitness of > 1 are more fit than the wild type. 
To test the statistical significance of the observed changes in fitness 
for each gene, we employed a nonparametric approach based on a 
reference distribution created by resampling insertion frequencies 
in TA sites in nonessential ORFs (defined as those with relative 
fitness between 0.95 and 1.05) and plotted relative fitness versus 
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FIG 3 Comparison of Tn-seq data sets to find chromosomal genes required during mammalian infection. (A) Volcano plot for identification of chromosomal 
genes selected in vivo. Each gene was compared between two in vitro data sets (filled pale blue squares) or between an in vitro data set and the in vivo data set (gray 
circles). Genes experiencing significant selection in vivo are represented by black circles. For each gene, relative fitness of insertion mutants in vivo is given with 
respect to the fitness of a wild-type strain. Significance is calculated by permuting the number of insertions sequenced at each TA in the gene between the two data 
sets (see Text SI in the supplemental material). Genes selected in vivo appear in the upper-left quadrant, while any genes that harbor more insertions in vivo than 
in vitro should track to the upper-right quadrant. Genes harboring no sequenced insertions in vivo in the central 90% of the ORF (relative fitness = 0) are shown 
in the box at the left of the plot. (B) Genes experiencing strong selection (psn and brnQ) harbor many fewer insertions after growth in vivo. The same effect is 
present but less pronounced in genes experiencing moderate levels of selection (ptsG). Insertions are shown by the Integrated Genome Viewer tool (63, 64) on 
a log scale. 



significance (Fig. 2A) (for method details, see Text SI in the sup- 
plemental material; for complete results, see Table SI). 

A drawback of this first approach is its insensitivity to ORFs 
containing multiple functional domains with differing contribu- 
tions to fitness. To address this problem, we used the hidden 
Markov model (HMM) recently developed by Dejesus and Io- 
erger (36). This method classifies genes as essential, growth disad- 
vantaged, nonessential, or growth advantaged. A complete list of 
classification of ORFs by the HMM algorithm is available in Ta- 
ble S2 in the supplemental material, and the metabolic pathways 
most commonly represented by this list of genes, as mapped by the 
Kyoto Encyclopedia of Genes and Genomes (37), are summarized 
in Table S3. Relative fitness values were good predictors of HMM 
class (Fig. 2B), indicating general agreement of these two meth- 
ods. However, a small number of genes classified as essential or 
growth deficient by the HMM were assigned relatively high rela- 
tive fitness values (above 0.75). 

In some cases, these results are consistent with known domain 
structure; this algorithm is particularly sensitive to consecutive TA 
sites without insertions and therefore correctly classifies ORFs 
containing subgenic essential domains (Fig. 2C). However, other 
genes in this group have insertion patterns that render their clas- 
sification suspect (discussed fully in Text S2 in the supplemental 
material). These include genes with relatively short (<20) runs of 
insertion-free TAs, which are less likely to represent functional 
protein domains, and runs of insertion-free TAs in unusually AT- 
rich sequences, which may be more difficult to sequence due to 
base composition bias in library preparation and/or Illumina se- 
quencing (38). 

The HMM analysis also identifies some plasmid genes as es- 
sential, including genes required for plasmid replication. The 



apparent selection on these genes does not stem from any essential 
function of the plasmids but instead reflects the biological impos- 
sibility of propagating insertions that prevent the replication of 
their host genetic element. The plasmid pCDl, which encodes the 
Y. pestis type III secretion system (T3SS), harbors a second class of 
genes that the HMM scores as essential or growth deficient. The 
majority of these are involved in regulating type III secretion 
(T3S) activity (e.g., yscM, IcrGVH, and IcrD), and insertions in 
these genes are underrepresented because uncontrolled expres- 
sion of the T3SS at 37°C in vitro inhibits growth of Y. pestis (e.g., 
see reference 39). Curiously, the pattern of T3SS genes selected 
in vitro deviates somewhat from current models of T3S regulation. 
In particular, the regulatory gene IcrE (sometimes called yopN) 
was not subject to significant selection under our in vitro condi- 
tion (falling into the nonessential HMM class), and we observed 
unexpectedly strong selection against yopB mutants in vitro. We 
do not conceive a simple explanation for these unexpected phe- 
notypes and suggest that this regulatory scheme may require fur- 
ther investigation. 

Identifying genes selected during infection. Because we were 
particularly interested in identifying genes required for optimal 
growth and survival during infection, we calculated the relative 
fitness of insertion mutants from the in vivo Tn-seq data set. To 
determine whether the period of growth in vivo significantly 
changed the abundance of insertion mutants for each gene, we 
developed a nonparametric statistic based on a reference distribu- 
tion created by permuting insertion frequencies in the relevant 
ORF from both the in vitro and in vivo data sets (see Text SI in the 
supplemental material) and plotted the relative fitness versus sig- 
nificance (Fig. 3A). A complete table of these values is provided in 
Table SI. Rare genes with substantial reductions in in vivo relative 
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TABLE 1 Chromosomal Y. pestis 


genes experiencing significant selection in deep tissue 






Gene name 


Gene function 


Relative fitness in vivo 


P value 


arcB (y0126) 


Aerobic respiration control 


0.798 


2.00E-04 


y0160 


Unknown function 


0.681 


2.90E-03 


pyrB (y0161) 


Pyrimidine biosynthesis 


0.566 


2.20E-03 


purD (y0501) 


Purine biosynthesis 


0.345 


4.00E-04 


purH (y0502) 


Purine biosynthesis 


0.316 


l.OOE-05 


miaA (y0629) 


IPP transferase to tRNA 


0 


2.80E-03 


hflK (y0633) 


FtsH protease regulator 


0.738 


l.OOE-05 


y0661 


Conserved domain of unknown function 


0.647 


l.OOE-04 


pcm (y0832) 


Protein L-isoaspartate O-methyltransferase 


0.677 


2.60E-03 


brnQ (y0981) 


Branched-chain amino acid import 


0 


l.OOE-05 


purJ<:(yll02) 


Purine biosynthesis 


0.298 


l.OOE-05 


pwr£ (yll03) 


Purine biosynthesis 


0.261 


6.00E-04 


purL (yl309) 


Purine biosynthesis 


0.168 


l.OOE-05 


guaB (yl362) 


Purine metabolism 


0 


6.00E-04 


guaA (yl363) 


Purine metabolism 


0 


4.20E-03 


purN (yl406) 


Purine biosynthesis 


0.534 


1.70E-03 


purM (yl407) 


Purine biosynthesis 


0 


1.50E-03 


yl412 


Putative zinc-dependent peptidase 


0.758 


l.OOE-05 


purC (yl421) 


Purine biosynthesis 


0 


1.00E-03 


yl483 


Putative histidine kinase sensor 


0.661 


l.OOE-05 


purF (yl605) 


Purine biosynthesis 


0.496 


l.OOE-05 


ptsG (yl767) 


Glucose importer 


0.693 


l.OOE-05 


(yl777) 


NADH dehydrogenase 


0.589 


l.OOE-05 


yl920 


LPS modification (arnD homologue) 


0.317 


4.00E-04 


yl940 


Contains putative L.D-transpeptidase domain 


0.548 


4.10E-03 


tonB (y2037) 


Membrane transport 


0 


1.12E-02 


pyrF (y2069) 


Pyrimidine biosynthesis 


0 


2.00E-02 


yfetP (y2397) 


Siderophore import 


0 


3.78E-02 


yfctA (y2398) 


Regulator of siderophore biosynthesis/import 


0 


l.OOE-05 


psn (y2404) 


Siderophore import 


0.160 


l.OOE-05 


cycA (y2447) 


D-Alanine/ D-serine/glycine permease 


0.709 


3.50E-03 


pabB (y2535) 


para-aminobenzoate synthesis 


0.690 


1.00E-04 


nroP (y2564) 


Aromatic amino acid import 


0.777 


1.00E-05 


WsF (y2627) 


Histidine biosynthesis 


0.570 


6.00E-04 


y2739 


Putative ATP-dependent protease 


0.591 


7.00E-04 


aspC (y2760) 


Amino acid metabolism/biosynthesis 


0.706 


l.OOE-04 


aroA (y2783) 


Aromatic amino acid biosynthesis 


0 


2.00E-03 


cydD (y2803) 


Cysteine/glutathione transport 


0.799 


2.00E-04 


cydC (y2805) 


Cysteine/glutathione transport 


0.815 


5.00E-04 


nrdA (y2974) 


Nucleotide biosynthesis 


0.713 


l.OOE-05 


nrdB (y2975) 


Nucleotide biosynthesis 


0 


l.OOE-05 


mefN(y3104) 


Methionine transport 


0 


l.OOE-05 


y3105 


Methionine transport 


0 


l.OOE-05 


metQ (y3106) 


Methionine transport 


0 


l.OOE-05 


ffepA (y3651) 


Thiamine transport 


0.545 


l.OOE-05 


thiP (y3652) 


Thiamine transport 


0.742 


l.OOE-05 


f/iiQ (y3653) 


Thiamine transport 


0 


7.20E — 03 


nhaA (y3704) 


Na + /H + antiporter 


0.375 


l.OOE-05 


y3803 


GTP-binding factor 


0.741 


4.00E-04 


hemN (y3807) 


Porphyrin metabolism 


0.579 


l.OOE-05 


env-Z (y3917) 


Histidine kinase osmosensor 


0.741 


1.60E-03 


pabA (y3953) 


para-aminobenzoate synthesis 


0.468 


1.60E-03 


nro£ (y4028) 


Aromatic amino acid biosynthesis 


0 


8.80E-03 



fitness that do not reach significance (Fig. 3A, lower left of plot) 
are associated with large growth defects both in vitro and in vivo 
(see Text SI). 

The PCR amplification and Illumina sequencing steps required 
for Tn-seq introduce variability into the measured relative abun- 
dance of each mutant in the library, leading to differences in mea- 
sured abundances between libraries that may reflect either biolog- 
ically meaningful differences or noise introduced during sample 
processing. To address this problem, we independently prepared, 



sequenced, and analyzed a second sample of the inoculum library 
(see Table SI in the supplemental material). The comparison of 
these two technical replicates allowed us to directly observe the 
extent of artifactual variation introduced by sample processing 
(Fig. 3A). ORFs in the in vivo data set that met a strict P value 
cutoff of 0.05 and that fell entirely outside the range of variability 
observed between inoculum library replicates were considered 
candidates for significantly selected genes 

Fifty-three chromosomal genes met these criteria for signifi- 
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FIG 4 Y. pestis requires the branched- chain amino acid importer BrnQ for virulence in mice. (A) Left: subcutaneous infection with 1,000 CFU KIM1001 (w.t.) 
(jj = 9), KIMlOOlAbmQ (AbrnQ) (n = 17), or the complemented strain KIM100lAbriiQ(pSP6) (complemented) (n = 5). Mice infected with KIM1001 or 
KIM100lAbrnQ(pSP6) died within the first week of infection, while the mutant KIMlOOlAbmQ strain failed to sicken or kill mice. Right: previous infection with 
KIM1001 AbrnQ protected 15 out of 16 mice from subcutaneous challenge with 1,000 CFU KIM1001 28- to 30 days later. (B) Growth of the attenuated strains 
1G150A (w.t.), JG150AAbr«Q (AbrnQ), and IG150AAbrnQ(pSP6) (complemented) in the defined serum-like medium SNM alone (left) or supplemented with 
2 mM leucine, isoleucine, and valine (right). Curves are representative of two independent experiments. 



cant selection in vivo (Table 1), many of which fell into common 
KEGG pathways (see Table S3 in the supplemental material). 
These pathways include purine and pyrimidine biosynthesis, me- 
tabolism of both aromatic and certain nonaromatic amino acids, 
biosynthesis of secondary metabolites, and the ABC transporter 
category. Genes selected in vivo also include hypothetical genes, 
genes of unknown function, and annotated genes with putative 
functions that have not previously been implicated in Y. pestis 
infection. Notable among these are y0661, a hypothetical gene 
containing a conserved domain of unknown function, and y 1920, 
encoding a homologue of the lipopolysaccharide (LPS) modifica- 
tion enzyme ArnD, which was recently reported to be selected in 
the closely related species Yersinia pseudotuberculosis during infec- 
tion (25) (Table 1). 

Branched-chain amino acid transport by BrnQ is required 
for Y. pestis virulence. The putative branched-chain amino acid 
importer encoded by brnQ (y0981) was among the most strongly 
selected genes in vivo, despite its apparent dispensability in vitro 
(Fig. 3A and B). A homologous importer has been reported to play 
a role in valine uptake during intracellular growth of Chlamydia 
trachomatis (40). To validate this observed in vivo fitness defect, 
we constructed an in-frame deletion of brnQ in the virulent strain 
KIM 1001. Subcutaneous infection with 10 3 (Fig. 4A) or 10 5 (see 
Fig. SI in the supplemental material) CFU of the KIMlOOlAfcraQ 
mutant failed to visibly sicken mice, while all mice infected with 
wild-type KIM1001 died within 8 days. Sixty percent of mice suc- 
cumbed to infection with 10 7 CFU of this mutant (see Fig. SI), 
indicating a roughly 1-millionfold increase in mean lethal dose in 
the absence of brnQ (since the 50% lethal dose [LD 50 ] of wild-type 



KIM 1001 is <50 CFU by subcutaneous infection [41]). Comple- 
mentation with the brnQ gene driven by its native promoter on the 
low-copy-number plasmid pSP6 restored virulence (Fig. 4A). 
Four days after subcutaneous infection with 1,000 CFU 
KIM 1001 AbrnQ, no Y. pestis could be recovered from the homog- 
enized spleens of mice (n = 5; limit of detection, <10 1 CFU/ 
spleen). 

To determine if infection with brnQ mutants could protect 
against subsequent infection, mice surviving infection with 
KIMlOOlAbmQ (a dose of 10 3 CFU) were challenged 28 or 
30 days after the initial infection with 10 3 CFU of wild-type 
KIM 1001. Fifteen of the sixteen mice previously infected with 
KIM 1001 AbrnQ survived the challenge without visible symp- 
toms, while all naive mice succumbed (Fig. 4A). 

Notably, brnQ mutants were not underrepresented in the in 
vitro Tn-seq data set. To investigate this phenomenon, we recon- 
structed the in-frame deletion of brnQ in the attenuated strain 
JG150A, allowing us to investigate growth of the mutant under 
biosafety level 2 (BSL2) conditions. In accordance with the in vitro 
Tn-seq data, IG150AAbrnQ grew as well as the wild-type control 
in the rich medium TB and had only a mild growth defect in 
PMH2, a defined medium commonly used for culturing Y. pestis 
(42) (Table 2; see also Fig. S2A and B in the supplemental mate- 
rial). Since brnQ is annotated as an ABC transporter that imports 
the branched-chain amino acids leucine, isoleucine, and valine, 
we formulated a new defined medium for culturing Y. pestis to 
probe for a deficiency in branched-chain amino acid acquisition. 
Serum nutritional medium (SNM) contains levels of magnesium, 
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TABLE 2 The brnQ deletion mutant grows poorly at low 
concentrations of branched-chain amino acids 





Growth rate" 




Relative 


Growth medium 


JG150A 


JG150AAferaQ 


fitness (W) b 


TB 


0.5450 ± 0.023 


0.5301 ± 0.044 


0.973 


SNM 


0.3987 ± 0.075 


0.0681 ± 0.023 


0.171 


SNM + 2 mM Leu, He, Val 


0.4063 ± 0.002 


0.3253 ± 0.001 


0.801 



a Growth rates of the wild-type JG150A strain and the deletion mutant JG150AA^r«Q 
were calculated from cultures at mid-log phase. Means and standard errors from a 
minimum of two independent experiments are shown. 

h In these experiments, relative fitness was calculated by taking the ratio of the 
JG150AAbrnQ growth rate and the TG150A growth rate. 



sodium, glucose, and all amino acids equivalent to those found in 
serum (see Text SI). 

The growth rate of JG150AAbrnQ cultured in SNM is less than 
one-fifth of the growth rate for wild- type JG150A in the same 
medium (relative fitness, 0.171). As in infection studies, comple- 
menting brnQ with a wild-type copy of the gene under its native 
promoter in trans restored growth of the mutant to wild-type 
levels (Fig. 4B). Growth could also be restored by supplementing 
SNM with 2 mM leucine, isoleucine, and valine — concentrations 
similar to those in PMH2 (Fig. 4B and Table 2). Interestingly, 
supplementing the medium with isoleucine and valine in the ab- 
sence of leucine did not fully restore growth (see Fig. S2C in the 
supplemental material), despite the presence of the apparently 
functional alternative leucine uptake system LS in Y. pestis KIM 
(HvKHMGF; y0422-y0426) (43) that has been reported to be ex- 
pressed in vivo (7). 

Glucose import by PtsG provides a competitive advantage to 
Y. pestis during infection. Recent work screening a defined panel 
of Y. pestis mutants implicated the glucose importer encoded by 
ptsG in bacterial fitness during infection (15). As with brnQ, the 
Tn-seq data set identified ptsG as significantly selected in vivo, but 
insertions in ptsG caused a much more modest reduction in rela- 
tive fitness (0.693) (Table 1 and Fig. 3A and B). To assess the 
contribution of ptsG to virulence, we subcutaneously infected 
C57BL/6 mice with 1,000 CFU of the in-frame deletion strain 
KIMlOOlApfsG. The course of disease with KIMlOOlApfsG did 
not differ significantly from that with wild-type KIM 1001 (Fig. 5). 
We therefore considered the hypothesis that the decrease in 



100 




IM 1 ' — i 

5 10 
days post-infection 

FIG 5 The glucose importer encoded by ptsG has little effect on Y. pestis 
virulence. Subcutaneous infections with 1,000 CFU KIM1001 (w.t.) (n = 10) 
or KIMlOOlApfsG (AptsG) (n = 14) were carried out. Thirteen out of fourteen 
mice infected with KIMlOOlApfsG died, with kinetics similar to those of mice 
infected with KIM 1001. 



growth caused by inactivation of ptsG is insufficient to affect the 
outcome of infection but that ptsG confers a competitive fitness 
advantage to Y. pestis during growth in mammals. 

We infected BALB/C and C57BL/6 mice intravenously with 
10 4 CFU of KIM 1001 and KIMlOOlApfsG in a 1:1 mixture. 
Spleens were homogenized 2 and 42 h postinfection, and the pro- 
portion ofpfsG mutants in the spleen was determined at each time 
point. The relative frequency of KIMlOOlApfsG mutants declined 
significantly over the course of the experiment (Fig. 6A). On av- 
erage, the proportion of KIMlOOlApfsG mutants in the spleen 
decreased from 0.434 at 2 h postinfection (i.e., 43.4% mutants) to 
0.039 (3.9%) at 42 h postinfection. Assuming constant exponen- 
tial growth rates during the infection, this change in frequency 
corresponds to a relative fitness of 0.8 for KIMlOOlApfsG relative 
to wild-type KIM 1001, a value close to that determined form the 
Tn-seq experiments (Table 1). 

We were able to recapitulate this phenotype in vitro in the 
defined serum-like medium SNM. Five replicate cultures were 
inoculated with equal numbers of the attenuated (T3SS-deficient) 
strains IG 102 and JG 1 02 AptsG. After 42 h of continuous log-phase 
growth, the proportion of pfsG mutants in the culture had de- 
creased from 0.5 to an average of 0.034 (Fig. 6B) . This reduction in 
frequency represents a relative fitness of 0.794 for thepfsG mutant 
relative to the control strain, IG102, in the same medium, very 
similar to the value of 0.8 observed in vivo. No growth defect was 
detected in SNM when the IG102ApfsG mutant was comple- 
mented with a low-copy-number plasmid carrying pfsG under 
control of its native promoter (see Fig. S3 and Text SI in the 
supplemental material). 

Because PtsG is annotated as a glucose importer, we tested 
whether glucose is required for the competitive advantage pro- 
vided by pfsG. No fitness defect was detected for the pfsG mutant 
grown in the absence of glucose (Fig. 6B), consistent with the 
apparently neutral phenotype of pfsG insertion mutants in the 
in vitro Tn-seq data set (Fig. 3B). In contrast, the proportion of 
pfsG mutants in cultures containing glucose decreased on average 
from 0.592 to 0.266 (Fig. 6B). This decrease corresponds to a rel- 
ative fitness of 0.92 for IG102ApfsG relative to wild-type IG102 in 
this rich culture medium, supporting the hypothesis that pfsG 
provides a competitive advantage to Y. pestis during infection by 
permitting or increasing glucose import. 

Intercellular complementation in vivo: genes required for 
siderophore-mediated iron acquisition. Growth in vivo strongly 
selected against insertions in the psn gene (y2404), which encodes 
the outer membrane TonB-dependent yersiniabactin importer 
(44), and mybtA (y2398), the regulator required for expression of 
psn and other genes in the yersiniabactin uptake system (45) (Ta- 
ble 1 and Fig. 3A and B). This agrees with an extensive body of 
literature describing the requirement for yersiniabactin-mediated 
iron acquisition for optimal growth under iron-limiting condi- 
tions (44, 46), including in vivo (4, 6, 47). 

However, the genes required for biosynthesis of the yersini- 
abactin siderophore (Ybt) were not selected in vitro or in vivo (see 
Table S4 in the supplemental material). This apparent contradic- 
tion almost certainly results from intercellular complementation. 
Extracellular yersiniabactin has been reported to complement the 
growth of mutants deficient in yersiniabactin biosynthesis even in 
low-iron environments (44). Because our experiments were per- 
formed at a relatively high bacterial density and the vast majority 
of the bacterial population remains functionally wild type with 
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FIG 6 ptsG provides a competitive advantage in the presence of glucose. (A) A 10 4 -CFU dose of wild-type KIM1001 and KIMlOOlApfsG (mixed 1:1) was 
injected intravenously into BALB/C or C57BL/6 mice. The proportion of KIMlOOlApfsG mutants in the population of Y. pestis in the spleen was determined 2 
and 42 h postinfection by plating on TB supplemented with 2.5 mM CaCl 2 , 1% glucose, and 50 fJ,g/ml tetrazolium red. (B) The attenuated strains JG102 and 
TG102ApfsG were mixed 1:1 in the rich medium TB, in TB supplemented with 5 mM glucose, and in the defined medium SNM (n = 5 for each condition). The 
proportion of JG102ApfsG in each culture was measured before and after 42 h of log-phase growth. 



respect to siderophore production, the concentration of extracel- 
lular yersiniabactin is likely high enough to provide iron to mu- 
tants deficient in Ybt synthesis. 

Native Y. pestis plasmids. All three of the native plasmids in 
Y. pestis KIM 1001 have been shown to have virulence functions in 
mice. To measure the contribution of plasmid genes to bacterial 
fitness during infection, we calculated the in vivo relative fitness 
and significance for each annotated ORF on the plasmids as de- 
scribed above for chromosomal genes. 

pCDl and genes of the type III secretion system. The type III 
secretion system encoded on the 70-kb plasmid pCDl is essential 
for virulence of Y. pestis. We observed very strong reductions in 
relative fitness of mutants defective for many T3S genes (see Ta- 
ble S4 in the supplemental material). For example, yscF, which 
encodes the major structural subunit of the needle, was classified 
as strongly selected in vivo (relative fitness = 0; see Table S4). IcrV, 
yopB, and yopD, encoding the translocon apparatus, also had large 
reductions in relative fitness (0, 0.55, and 0.69). Among the se- 
creted effectors, insertions in the genes encoding a GTPase- 
activating enzyme {yopE), a tyrosine phosphatase (yopH), and a 
regulatory effector iyopK) suffered clearly significant reductions 
in relative fitness (Fig. 7; see also Table S4). We did not detect a 
reduction in fitness for insertions in the effector-encoding genes 
yopM (encoding a possible inhibitor of infiammasome activa- 
tion), ypM (encoding a serine/threonine kinase), otyopT (encod- 
ing a cysteine protease). Insertions in yopj (encoding an acetyl- 
transferase) showed weak but not significant positive selection. 
These results are consistent with previous reports that some effec- 
tors are crucial while others appear to contribute very modestly or 
not at all to virulence (48-53). 

pMTl. No genes on pMTl experienced significant selection in 
vivo. pMTl carries a gene encoding a phospholipase D, ymt 
(Y1069), that was initially thought to contribute to virulence in 
mice; however, the lack of apparent selection on ymt is in accor- 
dance with more recent reports that ymt primarily promotes bac- 
terial survival in the flea vector (54) rather than enhancing viru- 
lence in mammals (55). pMTl also carries a gene, cafl (Y1100), 
encoding the fraction 1 capsule protein, which produces abundant 
pili loosely associated with the cell surface. As with ymt, we did not 
detect significant selection of cafl in vivo. 



pPCPl. None of the five annotated ORFs on pPCPl experi- 
enced significant selection in vivo. pPCPl encodes a single char- 
acterized virulence factor, the plasminogen activator encoded by 
pla, which plays a crucial role in progression to systemic infection 
following subcutaneous or intradermal infection (41, 56) but is 
less important in pulmonary infection (57) and has little effect 
following intravenous infection (58). Failure of pla mutants to 
show a fitness deficit in the Tn-seq data set is thus not surprising. 
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FIG 7 Many genes on the type III secretion plasmid pCDl undergo strong 
selection during infection. Each gene on pCDl was compared between two in 
vitro data sets (filled pale blue squares) or between an in vitro data set and the 
in vivo data set (gray circles). Genes experiencing significant selection in vivo 
are represented by black circles. Genes harboring no sequenced insertions in 
vivo in the central 90% of the ORF (relative fitness = 0) are shown in the box 
at the left of the plot. The effector proteins of the type III secretion system 
undergo variable levels of selection (red, yopH; yellow, yopE; dark blue, yopK; 
orange, ypkA; purple, yopM; pink, yopT; green, yopj), while the ATPase (tan, 
yscN) and structural components (tan, yscF) undergo strong selection. 
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DISCUSSION 

We detected a substantial fraction of the genes previously impli- 
cated in the virulence of Y. pestis (see Table S5 in the supplemental 
material). Some of the virulence genes not detected have been 
shown to be important via subcutaneous but not intravenous in- 
fection (e.g., pla and yadBC), while others may well have this 
property (ail, cafl, and psa; see Table S5). Alternatively, intercel- 
lular complementation may mask the requirement for cafl and/or 
psa, since both of these genes encode highly abundant secreted 
proteins. Our results are also generally in agreement with the re- 
cent study by Crimmins et al, which reports Tn-seq data for the 
closely related species Y. pseudotuberculosis following intravenous 
infection (25). As in our study, these investigators found crucial 
roles in vivo for purine biosynthesis (purM, purH, purC, and 
purD), aromatic amino acid biosynthesis {aroA and aroE), and 
LPS modification (arnD) (25). The remaining differences in our 
results may reflect biological differences between these two organ- 
isms, which differ in requirements for growth in vitro and cause 
very different diseases, but also reflect the increased sensitivity and 
resolution of our experiments, which employed a 100-fold-larger 
library. 

A comparison of the Tn-seq data set to the in vivo competition 
data for a defined panel of Y. pestis mutants recently published by 
Pradel et al. (15) reveals several genes with reduced fitness in both 
experiments. Interestingly, however, there remains considerable 
disagreement between the candidates identified by Pradel et al. 
and results of the Tn-seq approach described here. In particular, 
Pradel et al. found thatpfsG mutants are attenuated in a subcuta- 
neous single-infection model, whereas we found that ptsG mu- 
tants remained fully virulent. It is possible that the low dose used 
by Pradel et al, which they report as 10 CFU, may explain this 
discrepancy. 

Many genes experiencing significant in vivo selection during 
the Tn-seq experiment have roles in nutrient import or biosyn- 
thesis. The widespread use of rich media containing nutritional 
conditions not found in the host in both clinical and research 
settings may partly explain why these genes are often overlooked 
as therapeutic targets. As an illustration of this principle, the poor 
fitness of brnQ mutants in vivo cannot be detected on the rich 
medium TB but is readily apparent in media specifically designed 
to mimic nutritional conditions in serum. As illustrated by the 
strong immunity induced by infection with our brnQ mutant, 
importers required for nutrition in vivo may also be useful for the 
rational design of strains to be used as live vaccines. 

Even in media recapitulating the in vivo environment, a growth 
defect in vitro does not always translate to reduced virulence. The 
behavior of brnQ mutants versus that ofptsG mutants makes clear 
the important distinction between fitness and virulence: the de- 
tection of reduced relative fitness by Tn-seq does not necessarily 
imply attenuation of the corresponding mutant. In vivo fitness 
and virulence can be distinct properties, especially when the fit- 
ness reduction is small. 

With a few exceptions, Tn-seq studies reported to date present 
the observed effect of mutants in a given gene either as a ratio of 
the raw frequencies (e.g., see reference 22) or the log base 2 of this 
ratio (e.g., see reference 21). As we have done here, van Opijnen 
and Camilli (35) use the metric of relative fitness. We suggest the 
adoption of this measure as a standard for the presentation of 
Tn-seq results. The ratio of raw frequencies yields extreme values 



that can be misleading. Log base 2 of these ratios is proportional to 
relative fitness, but the constant of proportionality will differ de- 
pending on experimental details, and hence this metric may not be 
directly comparable among studies. Relative fitness suffers from 
neither of these complications and has several additional advan- 
tages: most importantly, its readily interpreted biological meaning 
and its congruence with the formal definitions for the synergy or 
independence of mutational effects. The latter is especially useful 
in genetic interaction studies, in which libraries constructed in 
specific mutant backgrounds are to be compared with libraries 
constructed in the wild-type parent. 

The wealth of available data on Yersinia pestis pathogenesis 
enables the critical evaluation of the Tn-seq method in the context 
of infection studies. For example, the intercellular complementa- 
tion of the yersiniabactin siderophore system suggests that inter- 
cellular complementation may be a common phenomenon in Tn- 
seq and related methods. In support of this conclusion, Price et al. 
observed strong intercellular complementation of the Y. pestis 
T3SS in lung tissue when they applied an alternate technique for 
screening transposon mutant libraries (TraSH) (59). The pattern 
we observe in the yersiniabactin system — a cluster of genes de- 
voted to synthesis of a secondary metabolite not under selection, 
coupled to a transporter under strong selection — may be a signa- 
ture for intercellular complementation by a highly diffusible small 
molecule. In the context of pathogenesis, a lack of an observed 
fitness deficit in Tn-seq studies cannot be interpreted as definitive 
evidence that the gene in question would not be important, or 
even essential, if the mutant were in pure culture. 

The requirement for a large inoculum to provide good cover- 
age of the genome is perhaps the most important limitation of the 
Tn-seq method. One of the most important properties of Y. pestis 
is its ability to produce systemic infection following intradermal 
inoculation with a few bacteria; however, this problem cannot be 
practically addressed by Tn-seq due to bottleneck effects that re- 
duce library diversity. As the work by Pradel et al. illustrates, a 
genomic-scale approach to identifying the genes required for in- 
fection via a peripheral route would be practicable only with an 
ordered array library orders of magnitude smaller than the ran- 
dom library used here and even then would require a large num- 
ber of mice to limit inoculum size and overcome mouse-to-mouse 
variation (15). Nonetheless, as we have shown here, even very 
large inocula (in our case, more than 2 million LD 50 s) can yield a 
wealth of data that accurately reflect the underlying biology estab- 
lished over many years for Y. pestis. The in vivo application of 
Tn-seq is clearly an approach of extraordinary power that should 
greatly accelerate the understanding of many pathogens. Develop- 
ment of adequate libraries may be a significant obstacle in many 
species, but effort devoted to overcoming this obstacle is very 
likely to be a worthwhile investment. 

MATERIALS AND METHODS 

Bacterial strains and growth conditions. Strains used in this study and 
details of strain construction are presented in Text S 1 in the supplemental 
material. Unless otherwise specified, E. coli strains were cultured in Luria 
broth (LB) and Y. pestis strains in the rich medium TB, prepared to max- 
imize plating efficiency as previously described (60). Additional experi- 
ments were performed in the defined media PMH2 (42) or SNM (see 
Text SI). All Y. pestis cultures were supplemented with 2.5 mM CaCl 2 to 
suppress type III secretion. Where appropriate, media were supplemented 
with 100 ju,g/ml ampicillin, 25 fig/ml zeocin, or 25 fig/ml diaminopimelic 
acid. 
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Library construction and Tn-seq. Experimental details of library gen- 
eration, selection, preparation for Illumina sequencing, sequence read 
processing, and statistical analysis are available in Text SI in the supple- 
mental material. Raw sequence reads are available in the NCBI Sequence 
Read Archive (SRA) (http://www.ncbi.nlm.nih.gov/Traces/sra) under 
BioProject PRJNA240677, accession numbers SRX483428 (reference in 
vitro library), SRX483566 (replicate in vitro library), and SRX483575 {in 
vivo library). 

Animal infections. This study was conducted in conformity with the 
Guide for the Care and Use of Laboratory Animals of the National Insti- 
tutes of Health and with the review and approval of the University of 
Massachusetts Medical School Institutional Animal Care and Use Com- 
mittee (IACUC). 

C57BL/6 and BALB/C mice were infected as appropriate. All bacterial 
cultures used for inoculation were grown at 37°C on TB agar (TB medium 
with 1.5% agar) containing 2.5 mM CaCl 2 overnight prior to infection. 
Bacterial cells were diluted in infection-grade phosphate-buffered saline 
(PBS) to the desired concentration. In every case, the number of viable 
bacteria present in each dose was verified by dilution plating of the inoc- 
ulum. Mice were monitored every 12 h for signs of illness, such as ruffled 
fur, shallow breathing, limping, reluctance to move, and swollen lymph 
nodes. 

ptsG competition experiments. Mice were sacrificed 2 h or 42 h fol- 
lowing coinfection with 3 X 10 3 CFU of a 1:1 mixture of KIM1001 and 
KIMlOOlApfsG. Spleens were removed and homogenized in 0.5 ml sterile 
PBS. Dilutions were plated on TB agar containing 2.5 mM CaCl 2 , 1% 
glucose, and 50 u,g/ml tetrazolium red. On this medium, the wild-type 
strain KIM 1001 forms white colonies, while colonies of KIMlOOlApfsG 
are purple (61). In vitro competition experiments were performed by mix- 
ing JG102 and JG102ApfsG 1:1 and plating cultures on TB agar containing 
2.5 mM CaCl 2 , 1% glucose, and 50 /ug/ml tetrazolium red after 0 h or 42 h 
of continuous log-phase growth at 37°C. The fraction of JG102ApfsG 
colonies was determined by calculating the fraction of purple colonies on 
these dilution plates. 

Because the JG102 strain and the complemented strain 
JG102ApfsG(pZS*13luc::pfsG) both ferment glucose, both form white 
colonies on TB agar supplemented with 1% glucose and 50 fig/ ml tetra- 
zolium red. In vitro competition experiments with the complemented 
strain were therefore performed against the control strain, 
JG102Axyffi(pZS*131uc), cultured in the presence of 100 /ug/ml ampicil- 
lin. Dilutions were plated on TB agar supplemented with 2.5 mM CaCl 2 , 
1% xylose, and 30 /u,g/ml neutral red after 0 h or 42 h of continuous 
log-phase growth at 37°C. On these plates, the complemented strain, 
JG102ApfsG(pZS*131uc::pfsG), forms white colonies, and the control 
strain, JG102Axy/B(pZS*131uc), forms red colonies. The fraction of 
JG102ApfsG(pZS*131uc::pfsG) colonies was determined by calculating 
the fraction of red colonies on these dilution plates. 

The change in the growth rate was determined by plotting the natural 
log of the ratio of the ptsG mutant to the wild-type strain over time and 
calculating the slope by linear regression. Absolute change in growth rate 
was applied to the measured growth rate for the wild-type strain to deter- 
mine mutant growth rates. 
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